Image Transformer
نویسندگان
چکیده
Image generation has been successfully cast as an autoregressive sequence generation or transformation problem. Recent work has shown that self-attention is an effective way of modeling textual sequences. In this work, we generalize a recently proposed model architecture based on self-attention, the Transformer, to a sequence modeling formulation of image generation with a tractable likelihood. By restricting the selfattention mechanism to attend to local neighborhoods we significantly increase the size of images the model can process in practice, despite maintaining significantly larger receptive fields per layer than typical convolutional neural networks. We propose another extension of self-attention allowing it to efficiently take advantage of the two-dimensional nature of images. While conceptually simple, our generative models significantly outperform the current state of the art in image generation on ImageNet, improving the best published negative log-likelihood on ImageNet from 3.83 to 3.77. We also present results on image super-resolution with a large magnification ratio, applying an encoder-decoder configuration of our architecture. In a human evaluation study, we show that our super-resolution models improve significantly over previously published super-resolution models. Images generated by the model fool human observers three times more often than the previous state of the art.
منابع مشابه
A Fast Method for Calculation of Transformers Leakage Reactance Using Energy Technique
Energy technique procedure for computing the leakage reactance in transformers is presented. This method is very efficient compared with the use of flux element and image technique and is also remarkably accurate. Examples of calculated leakage inductances and the short circuit impedance are given for illustration. For validation, the results are compared with the results obtained using practic...
متن کاملCorrection of Lens-Distortion for Real-Time Image Processing Systems
This paper presents the design of a real-time imaging system, with incorporated the correction of lens-distorted images. It may be used in medical applications (e.g. real-time X-ray image intensifiers), industrial robot vision products or consumer electronics. The system contains two different VLSI-circuits: a transformer and an interpolator. The transformer calculates an address that points to...
متن کاملRemote Monitoring System For A Switchable Distribution Transformer By The Use Of Wireless ZigBee Technology
This paper proposes a wireless ZigBee technology to monitor the parameters of the transformer. The transformer parameters such as voltage, current, power factor and temperature can be monitored through wireless ZigBee technology. Embedded Ethernet is used to develop client and server applications. Acquisition of voltages, currents, temperatures, active and reactive power, controlling the switch...
متن کاملA Novel Approach of Transformer Oil Quality Analysis Using Image Processing
Electrical energy is the paramount need in a nation’s development. To cater for large demand for electricity there is a need for reliable and proficient power system. For a power system to work reliably, the role of Transformers is critical. Health of the transformer mainly depends on its insulation. Among the different insulating material used in transformers, mineral oil is the most widely us...
متن کاملHierarchical Spatial Transformer Network
Computer vision researchers have been expecting that neural networks have spatial transformation ability to eliminate the interference caused by geometric distortion for a long time. Emergence of spatial transformer network makes dream come true. Spatial transformer network and its variants can handle global displacement well, but lack the ability to deal with local spatial variance. Hence how ...
متن کاملDense Transformer Networks
The key idea of current deep learning methods for dense prediction is to apply a model on a regular patch centered on each pixel to make pixel-wise predictions. These methods are limited in the sense that the patches are determined by network architecture instead of learned from data. In this work, we propose the dense transformer networks, which can learn the shapes and sizes of patches from d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1802.05751 شماره
صفحات -
تاریخ انتشار 2018